One-Class Classification by Combining Density and Class Probability Estimation

نویسندگان

  • Kathryn Hempstalk
  • Eibe Frank
  • Ian H. Witten
چکیده

One-class classification has important applications such as outlier and novelty detection. It is commonly tackled using either density estimation techniques or by adapting a standard classification algorithm to the problem of carving out a decision boundary that describes the location of the target data. In this paper we present a simple method for one-class classification that combines the application of a density estimator, used to form a reference distribution, with the induction of a standard model for class probability estimation. In our method, the reference distribution is used to generate artificial data that is employed to form a second, artificial class. In conjunction with the target class, this artificial class is the basis for a standard two-class learning problem. We explain how the density function of the reference distribution can be combined with the class probability estimates obtained in this way to form an adjusted estimate of the density function of the target class. Using UCI datasets, and data from a typist recognition problem, we show that the combined model, consisting of both a density estimator and a class probability estimator, can improve on using either component technique alone when used for one-class classification. We also compare the method to one-class classification using support vector machines.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classifier-Adjusted Density Estimation for Anomaly Detection and One-Class Classification

Density estimation methods are often regarded as unsuitable for anomaly detection in high-dimensional data due to the difficulty of estimating multivariate probability distributions. Instead, the scores from popular distanceand localdensity-based methods, such as local outlier factor (LOF), are used as surrogates for probability densities. We question this infeasibility assumption and explore a...

متن کامل

Accounting for secondary variable for the classification of mineral resources using co-kriging technique; a Case study of Sarcheshmeh porphyry copper deposit

Due to substantial effect of classification of resource models on future mine planning, one should come with an accurate method of estimation to guarantee that the minimum error is acquired in the estimation process. The known world class Cu-Mo deposit, Sarcheshmeh Porphyry deposit (central Iran) selected as the study area. The Hypogene zone of the deposit was chosen as the space in which estim...

متن کامل

Analysis of a Fusion Method for Combining Marginal Classifiers

The use of multiple features by a classifier often leads to a reduced probability of error, but the design of an optimal Bayesian classifier for multiple features is dependent on the estimation of multidimensional joint probability density functions and therefore requires a design sample size that, in general, increases exponentially with the number of dimensions. The classification method desc...

متن کامل

Studying Effectiveness of Landsat ETM+ Satellite Images Classification Methods in Identification of desert pavements (Case study: South of Semnan)

Extended abstract 1- Introduction The process of identifying landforms is a subject that has been researched by many researchers. All the definitions of geomorphology emphasize the study and identification of landforms. Understanding landforms and how they are distributed are some sort of essential requirements in applied geomorphology and other environmental sciences (Shayan et al., 2012). O...

متن کامل

Overriding the Experts: A Fusion Method for Combining Marginal Classifiers

The design of an optimal Bayesian classifier for multiple features is dependent on the estimation of multidimensional joint probability density functions and therefore requires a design sample size that increases exponentially with the number of dimensions. A method was developed that combines classification decisions from marginal density functions using an additional classifier. Unlike voting...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008